Clustering Time-Varying Gene Expression Profiles using Scale-space Signals

نویسنده

  • Tanveer F. Syeda-Mahmood
چکیده

The functional state of an organism is determined largely by the pattern of expression of its genes. The analysis of gene expression data from gene chips has primarily revolved around clustering and classification of the data using machine learning techniques based on the intensity of expression alone with the time-varying pattern mostly ignored. In this paper, we present a pattern recognition-based approach to capturing similarity by finding salient changes in the time-varying expression patterns of genes. Such changes can give clues about important events, such as gene regulation by cell-cycle phases, or even signal the onset of a disease. Specifically, we observe that dissimilarity between time series is revealed by the sharp twists and bends produced in a higher-dimensional curve formed from the constituent signals. Scale-space analysis is used to detect the sharp twists and turns and their relative strength with respect to the component signals is estimated to form a shape similarity measure between time profiles. A clustering algorithm is presented to cluster gene profiles using the scale-space distance as a similarity metric. Multi-dimensional curves formed from time series within clusters are used as cluster prototypes or indexes to the gene expression database, and are used to retrieve the functionally similar genes to a query gene profile. Extensive comparison of clustering using scale-space distance in comparison to traditional Euclidean distance is presented on the yeast genome database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering Time-Varying Gene Expression Pro les using Scale-space Signals

The functional state of an organism is determined largely by the pattern of expression of its genes. The analysis of gene expression data from gene chips has primarily revolved around clustering and classi cation of the data using machine learning techniques based on the intensity of expression alone with the time-varying pattern mostly ignored. In this paper, we present a pattern recognition-b...

متن کامل

Clustering samples characterized by time course gene expression profiles using the mixture of state space models.

We propose a novel method to classify samples where each sample is characterized by a time course gene expression profile. By exploiting the mixture of state space model, the proposed method addresses the following tasks: (1) clustering samples according to temporal patterns of gene expressions, (2) automatic detection of genes that discriminate identified clusters, (3) estimation of a restrict...

متن کامل

بررسی اثرات تغییر بیان ریز آر ان ای های سلولی ناشی از ویروس پاپیلوم انسانی در سلول های سرطانی سنگفرشی سر و گردن در سطح پروفیل بیان ژنی

Background and aim: Human Papilloma Virus plays an important role in some of human malignancies and causes alterations in normal expression levels of cellular microRNAs. In this paper, we evaluated the effects of such changes on Head and Neck Squamous Cell Carcinoma tumor samples at gene expression profile level. Methods: in this descriptive-analytical study, gene expression profiles of 36 tum...

متن کامل

Dynamic Modelling of Microarray Time Course Data

The analysis of gene expression profiles, obtained from DNA microarray experiments, is used to discover relationships between genes and to discern groups of genes involved common processes. The principal aim of this paper is to introduce dynamic modelling of microarray time course data. A novel approach to identify similar gene expression profiles is presented. Using parametric modelling, we de...

متن کامل

Mesenchymal Stem/Stromal-Like Cells from Diploid and Triploid Human Embryonic Stem Cells Display Different Gene Expression Profiles

Background: Human ESCs-MSCs open a new insight into future cell therapy applications, due to their unique characteristics, including immunomodulatory features, proliferation, and differentiation. Methods: Herein, hESCs-MSCs were characterized by IF technique with CD105 and FIBRONECTIN as markers and FIBRONECTIN, VIMENTIN, CD10, CD105, and CD14 genes using RT-PCR technique. FACS was performed fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings. IEEE Computer Society Bioinformatics Conference

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2003